# Quantization Optimization
Gemma 2 9b It Abliterated GGUF
A quantized version based on Gemma 2.9B, optimized using llama.cpp, suitable for running in LM Studio.
Large Language Model English
G
bartowski
3,941
37
Gemma 3 12B It Qat GGUF
Gemma 3 12B IT is a large language model developed by Google, supporting multimodal input and long-context processing.
Image-to-Text
G
lmstudio-community
36.65k
4
Elastic Llama 3.1 8B Instruct
Apache-2.0
An elastically optimized version of Meta-Llama-3.1-8B-Instruct, offering model variants with different speed and precision levels, suitable for self-deployment scenarios.
Large Language Model
E
TheStageAI
125
3
Qwen Ai Research Qa Q4 K M.gguf
MIT
A Q&A model specifically designed for answering research-oriented AI questions, optimized with Q4_K_M quantization format to achieve efficient reasoning while maintaining high-quality responses.
Large Language Model English
Q
InduwaraR
29
2
Llava 1.6 Mistral 7b Gguf
Apache-2.0
LLaVA is an open-source multimodal chatbot, trained by fine-tuning LLM on multimodal instruction-following data. This version is the GGUF quantized version, offering multiple quantization options.
Text-to-Image
L
cjpais
9,652
106
Multilingual E5 Small Optimized
MIT
This is the quantized version of multilingual-e5-small, optimized for inference performance through layer-wise quantization while retaining most of the original model's quality.
Text Embedding Supports Multiple Languages
M
elastic
201
15
Featured Recommended AI Models